Method and Apparatus for Navigating a Video Via a Transcript of Spoken Dialog

ABSTRACT

A method and apparatus for navigating a media program via a searchable transcript of the dialog of the media program is disclosed. In one embodiment, a textural transcript of the dialog is generated, wherein the textural transcript comprising a plurality of portions wherein each portion is associated with a segment of the media program, a command is accepted to display the transcript and in response to that command, user interface data is transmitted to the client computer for presentation in a user interface, wherein the user interface comprising a concurrently presented media program player and the textural transcript.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.12/635,605, entitled “USER INTERFACE INCLUDING CONCURRENT DISPLAY OFVIDEO PROGRAM, HISTOGRAM, AND TRANSCRIPT” filed on Dec. 10, 2009, whichis incorporated by reference herein.

This application is also related to the following co-pending andcommonly assigned patent application(s), all of which applications areincorporated by reference herein:

U.S. application Ser. No. 12/635,609, entitled “METHOD AND APPARATUS FORNAVIGATING A MEDIA PROGRAM VIA A HISTOGRAM OF POPULAR SEGMENTS” filed onDec. 10, 2009, by Eugene Chuan-Huai Wei, Varun Narang, Zhibing Wang,Yuming Liang, Xin Jin, Ting-hao Yang, Eric Feng and Andrew Cheng-minLin.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to systems and methods for streaming mediaprograms, and in particular to a system and method for visuallydepicting popular portions of a media program and navigating same.

2. Description of the Related Art

The dissemination and playback of media programs has undergonesubstantial changes in the past decade. Previously, media programs(which may include audio, video, or both) were disseminated either byanalog broadcast (conventional, satellite, or cable) or by disseminationof films to movie theaters.

These traditional dissemination and playback means remain in use afterthe advent of digital technology. However, digital technologies have hada profound effect on the dissemination and playback of media programs.

First, digital technology permitted the use of digital video recorders(DVRs). DVRs, while similar in function to standard analog videocassette recorders (VCRs), provide a number of additional usefulfunctions including live pause, the ability to record one program whileplaying back another, and the integration of the electronic programguides with DVR functionality (so that the recordation of media programscould be scheduled far in advance).

Second, digital technology also permitted the dissemination and playbackof media programs via the Internet, and with improved signal processingand more and more households with high-speed Internet access (e.g. DSL,fiber, and satellite). These methods of dissemination and playback havebecome competitive with traditional means. Dissemination of mediaprograms via the Internet may occur either by simple downloading,progressive downloading or streaming.

Simple downloading downloads the bytes of the media file in anyconvenient order, while progressive download downloads bytes at thebeginning of a file and continues downloading the file sequentially andconsecutively until the last byte. At any particular time duringprogressive downloading, portions of the file is not be immediatelyavailable for playback . . . the entire file must be downloaded firstbefore a media player can start playback.

For progressive downloading, a media file having the media program isdownloaded via the Internet using dial-up, DSL, ADSL, cable, T1, orother high-speed connection. Such downloading is typically performed bya web server via the Internet. In progressive downloading, media playersare able to start playback once enough of the beginning of the file hasdownloaded, however, the media player must download enough informationto support some form of playback before playback can occur. Playback ofprogressively downloaded media files is often delayed by slow Internetconnections and is also often choppy and/or contains a high likelihoodof stopping after only a few seconds. Once a progressively downloadedmedia program has been completely downloaded, it may be stored on theend-user computer for later use.

One of the disadvantages of a progressive downloading is that the entitytransmitting the data (the web server) simply pushes the data to theclient as fast as possible. It may appear to be “streaming” the videobecause the progressive download capability of many media players allowsplayback as soon as an adequate amount of data has been downloaded.However, the user cannot fast-forward to the end of the file until theentire file has been delivered by the web server. Another disadvantagewith progressive downloading is that the web server does not makeallowances for the data rate of the video file. Hence if the networkbandwidth is lower than the data rate required by the video file, theuser would have to wait a period of time before playback can begin. Ifplayback speed exceeds the data transfer speed, playback may be pausedfor a period of time while additional data is downloaded, interruptingthe viewing experience. However, the video playback quality may behigher when the playback occurs because of the potentially higher datarate. For example, if a 100 kbps video file can be delivered over a 56kbps modem, the video will be presented at the 100 kbps rate, but theremay be periods when playback will be paused while additional video datais downloaded. The video data is typically downloaded and stored as atemporary file in its entirety.

Web servers typically use HTTP (hypertext transport protocol) on top ofTCP (transfer control protocol) to transfer files over the network. TCP,which controls the transport of data packets over the network, isoptimized for guaranteed delivery of data, not speed. Therefore, if abrowser senses that data is missing, a resend request will be issued andthe data will be resent. In networks with high delivery errors, resendrequests may consume a large amount of bandwidth. Since TCP is notdesigned for efficient delivery of adequate data or bandwidth control(but rather guaranteed delivery of all data), it is not preferred forthe delivery of video data in all applications.

Streaming delivers media content continuously to a media player andmedia playback occurs simultaneously. The end-user is capable of playingthe media immediately upon delivery by the content provider. Traditionalstreaming techniques originate from a single provider delivering astream of data to a set of end-users. High bandwidths and centralprocessing unit (CPU) power are required to deliver a single stream to alarge audience, and the required bandwidth of the provider increases asthe number of end-users increases.

Unlike progressive downloading, streaming media can be deliveredon-demand or live. Wherein progressive download requires downloading theentire file or downloading enough of the entire file to start playbackat the beginning, streaming enables immediate playback at any pointwithin the file. End-users may skip through the media file to startplayback or change playback to any point in the media file. Hence, theend-user does not need to wait for the file to progressively download.Typically, streaming media is delivered from a few dedicated servershaving high bandwidth capabilities.

A streaming media server is a specialized device that accepts requestsfor video files, and with information about the format, bandwidth andstructure of those files, delivers just the amount of data necessary toplay the video, at the rate needed to play it. Streaming media serversmay also account for the transmission bandwidth and capabilities of themedia player. Unlike the web server, the streaming media severcommunicates with the client computer using control messages and datamessages to adjust to changing network conditions as the video isplayed. These control messages can include commands for trick playfunctions such as fast forward, fast reverse, pausing, or seeking to aparticular part of the file. Since a streaming media server transmitsvideo data only as needed and at the rate that is needed, precisecontrol over the number of streams served can be maintained. Unlike thecase with progressive downloading, the viewer will not be able to viewhigh data rate videos over a lower data rate transmission medium.However, streaming media servers (1) provide users random access to thevideo file, (2) allows monitoring of who is viewing what video programsand how long they are watched (3) use transmission bandwidth moreefficiently, since only the amount of data required to support theviewing experience is transmitted, and (4) the video file is not storedin the viewer's computer, but discarded by the media player, thusallowing more control over the content.

Streaming media servers may use HTTP and TCP to deliver video streams,but generally use RSTP (real time streaming protocol) and UDP (userdatagram protocol). These protocols permit control messages and savebandwidth by reducing overhead. Unlike TCP, when data is dropped duringtransmission, UDP does not transmit resent requests. Instead, the servercontinues to send data. Streaming media servers can also deliver livewebcasts and can multicast, which allows more than one client to tuneinto a single stream, thus saving bandwidth.

Typically, progressively downloaded media is transmitted to the clientcomputer at a rate that is faster than playback. The media programplayer buffers this data, and may indicate how much of the media programhas been buffered by providing an indicator, usually as a part of a“progress bar.” A control is often provided that allows the user to goto any point in the program that has already been buffered by selectingthe control and moving it to a different location along the progressbar. This allows the user to randomly access any buffered portion of themedia program.

A similar control is often provided with streaming media. However, withstreaming media, the control can be moved to any point in the program.If the control is moved to a temporal point in the program that has beenbuffered, playback can commence from that point, much like is the casewith progressively downloaded data. However, if the control is moved toa temporal point in the program that has not been buffered, the mediaprogram player transmits a control message having a request to thestreaming server to play the media program beginning at the temporallocation represented by the control. The streaming server receives therequest and transmits the media program to the client computer,beginning at the requested point.

One of the difficulties with the viewing of streamed and progressivelydownloaded media programs is that although the user may potentially movethe control on the progress bar to move forward or backward in the mediaprogram, the user is given little or no information about the mediacontent at the current location of the control. Hence, the user iscompelled to “hunt” for scenes that they are interested in. This is notonly inconvenient for the user, in the case of a streamed media program,the user's requests for different points in the media program can placean unnecessary burden on the streaming server. What is needed is amethod and apparatus for quickly navigating a media program.

SUMMARY OF THE INVENTION

A method and apparatus for navigating a media program via a searchabletranscript of the dialog of the media program is disclosed. In oneembodiment, a textural transcript of the dialog is generated, whereinthe textural transcript comprising a plurality of portions wherein eachportion is associated with a segment of the media program, a command isaccepted to display the transcript and in response to that command, userinterface data is transmitted to the client computer for presentation ina user interface, wherein the user interface comprising a concurrentlypresented media program player and the textural transcript.

BRIEF DESCRIPTION OF THE DRAWINGS

Referring now to the drawings in which like reference numbers representcorresponding parts throughout:

FIG. 1 is a diagram illustrating an exemplary media program system;

FIG. 2 illustrates an exemplary computer system that could be used toimplement elements the present invention;

FIG. 3 is a diagram illustrating a content delivery subsystem andtop-level operations that can be used to deliver media programs andadvertisements for presentation to a user;

FIG. 4 is a diagram illustrating the streaming of a media program;

FIG. 5 is a diagram presenting exemplary method steps that can be usedto present a histogram to the user;

FIG. 6 is a diagram illustrating one embodiment of how the operationsdepicted in FIG. 5 can be implemented;

FIG. 7 is a diagram illustrating a typical embodiment of at least aportion of a displayed histogram;

FIG. 8A is a diagram illustrating exemplary method steps in which thehistogram can be used to assist in the navigation of the media program;

FIG. 8B is a diagram illustrating exemplary method steps that can beused to generate the thumbnails from which representative thumbnails areassociated with media program segments;

FIG. 9 is a diagram illustrating one embodiment of a user interfacecomprising a histogram and a thumbnail;

FIG. 10 is a diagram illustrating exemplary steps that can be used topresent a media program transcript to the user;

FIG. 11 is a diagram presenting an exemplary user interface by which theuser may elect to view the transcript of a media program;

FIG. 12A is a diagram presenting exemplary method steps that can be usedto generate the textural transcript;

FIG. 12B is a diagram presenting exemplary steps that can be used toperform a keyword search of the transcript of the media program;

FIG. 13 is a diagram illustrating the user interface shown in FIG. 11after the entry and search for keywords;

FIG. 14 is a diagram of an embodiment of a histogram, showing the use ofhistogram tags; and

FIG. 15 is a diagram presenting an exemplary user interface 1500 forcreating code that can be used to embed a selected portion of the mediaprogram into a webpage

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

In the following description, reference is made to the accompanyingdrawings which form a part hereof, and which is shown, by way ofillustration, several embodiments of the present invention. It isunderstood that other embodiments may be utilized and structural changesmay be made without departing from the scope of the present invention.

FIG. 1 is a diagram illustrating an exemplary media program system 100.In the illustrated embodiment, the system 100 may comprise one or moremedia program sources 120A, 120B (alternatively referred to as mediaprogram source(s) 120) communicatively coupled to a communicationnetwork 104 such as the Internet. Each of the media program sources 120includes one or more source media servers 122A, 122B (alternativelyreferred to hereinafter as source media server(s) 122). In oneembodiment, the source media servers store databases of information,including the media programs.

The media program system 100 further comprises and a media programprovider 110, communicatively coupled to the communication network 104,and having one or more provider media servers 112 and one or moreprovider web servers 114. In one embodiment, the media program provider110 is a video-on-demand and/or streaming media program provider.

The media program system 100 may stream media programs to the clientcomputers 102A-102C (hereinafter alternatively referred to as clientcomputer(s) 102) directly from the media program provider 110, or themedia program provider 110 may operate as a portal, providing aninterface to the media programs available from the media program sources120, but not the media program itself (which is instead provided by themedia program source(s) 120).

In the first case, the media program provider 110 licenses mediaprograms from the media program sources 120 (such as www.fox.com orwww.nbc.com), and metadata for such programs is also typically providedto the media program provider 110 from the media program source 120 aswell. Such metadata can be retrieved by a database maintained by themedia program provider 110. If supplementary metadata is required, itcan be obtained from a metadata source 130 independent from the mediaprogram provider 110 and the media program source 120.

In the second case, the media programs are streamed to the clientcomputers 102 directly from the servers 122 of the media program source120. When the media program is streamed directly from the media programsource 120, it is often the case that the metadata provided by the mediaprogram source 120 is insufficient. In such cases, supplementarymetadata may be obtained from independent metadata source 130 (such aswww.tv.com or www.imdb.com) or other third party sources. In thiscircumstance, the role of the media program provider 110 is that of aportal that provides users 132A-132C (hereinafter alternatively referredto as user(s) 132) a list of available media programs and an interfaceto search to find such programs and to view them.

Media programs and metadata may be obtained via a communication network104 such as the Internet, or through auxiliary (and/or dedicated)communication links 134). Such information may be obtained by webcrawling (for example, using a program or automated script that browsesthe World Wide Web in a methodical, automated manner).

Using a client computer(s) 102, remote user(s) 132 can communicate withthe media program provider 110 using the communication network 104, toobtain media programs (including video-on-demand and/or streaming videoservices) and to search provider media program databases to find mediaprograms of interest.

The media program system 100 may also comprise one or more advertisementprovider 140, which supply advertisements that are replayed inconnection with the media programs provided by the media programprovider 110 or media program sources 120. In the illustratedembodiment, the advertisement provider 140 includes an advertisementprovider server 142 communicatively coupled to an associated memory,which may store a database of advertisements.

Advertisements may be supplied from the advertisement provider 140 tothe media program provider 110 via the Internet 104, a dedicated link146, or by physical exchange of a memory storage device having theadvertisement. Such advertisements can be provided to and stored by themedia program provider 110 and streamed or downloaded along with themedia program to the client computer(s) 102 at the appropriate time.

In one embodiment, the advertisements are integrated with the streamedor downloaded video from the media program provider 110. In anotherembodiment, the advertisements are not integrated with the mediaprogram, but are instead transmitted to the client computer(s) 102separately from the media program, and replayed at the appropriate timeusing indices that indicate when each advertisement should be presented.For example, advertisements can be indexed and streamed or downloaded tothe client computer(s) 102 (from the media program provider 110 or theadvertisement provider 140) and such advertisements can be played backto the user(s) 132 at times indicated by corresponding indices in themedia program.

FIG. 2 illustrates an exemplary computer system 202 that could be usedto implement elements the present invention. The computer 202 comprisesa general purpose hardware processor 204A and/or a special purposehardware processor 204B (hereinafter alternatively collectively referredto as processor 204) and a memory 206, such as random access memory(RAM). The computer 202 may be coupled to other devices, includinginput/output (I/O) devices such as a keyboard 214, a mouse device 216and a printer 228.

In one embodiment, the computer 202 operates by the general-purposeprocessor 204A performing instructions defined by the computer program210 under control of an operating system 208. The computer program 210and/or the operating system 208 may be stored in the memory 206 and mayinterface with the user(s) 132 and/or other devices to accept input andcommands and, based on such input and commands and the instructionsdefined by the computer program 210 and operating system 208 to provideoutput and results.

Output/results may be presented on the display 222 or provided toanother device for presentation or further processing or action.Typically, the display 222 comprises a plurality of picture elements(pixels) that change state to collectively present an image to theuser(s) 132. For example, the display 222 may comprise a liquid crystaldisplay (LCD) having a plurality of separately addressable pixels, eachwith a liquid crystal that changes to an opaque or translucent state toform a part of the image on the display in response to the data orinformation generated by the processor 204 from the application of theinstructions of the computer program 210 and/or operating system 208 tothe input and commands. Similarly, plasma displays include a pixelhaving three separate subpixel cells, each with a different colorphosphor. The colors blend together to create the color presented in thepixel. Pulses of current flowing through the cells are varied accordingto the data generated by the processor from the application of theinstructions of the computer program and/or operating system 208 inresponse to input and commands, changing the intensity of the lightprovided by the pixel. Also, similarly, cathode ray tube (CRT) displaysinclude a plurality of pixels, each with each pixel having subpixelstypically represented by dots or lines from an aperture grille. Each dotor line includes a phosphor coating that glows when struck by electronsfrom an electron gun. In response to the data generated by the processorfrom the application of instructions of the computer program and/oroperating system 208 and in response to input and commands, theelectrons emitted by the electron gun are steered at the dots or lines,thus changing the state of the associated pixel by causing the phosphorcoating of that dot or line to glow.

The image may be provided through a graphical user interface (GUI)module 218A. Although the GUI module 218A is depicted as a separatemodule, the instructions performing the GUI functions can be resident ordistributed in the operating system 208, the computer program 210, orimplemented with special purpose memory and processors.

Some or all of the operations performed by the computer 202 according tothe computer program 110 instructions may be implemented in a specialpurpose processor 204B. In this embodiment, the some or all of thecomputer program 210 instructions may be implemented via firmwareinstructions stored in a read only memory (ROM), a programmable readonly memory (PROM) or flash memory in within the special purposeprocessor 204B or in memory 206. The special purpose processor 204B mayalso be hardwired through circuit design to perform some or all of theoperations to implement the present invention. Further, the specialpurpose processor 204B may be a hybrid processor, which includesdedicated circuitry for performing a subset of functions, and othercircuits for performing more general functions such as responding tocomputer program instructions. In one embodiment, the special purposeprocessor is an application specific integrated circuit (ASIC).

The computer 202 may also implement a compiler 212 which allows anapplication program 210 written in a programming language such as COBOL,C++, FORTRAN, or other language to be translated into processor 204readable code. After completion, the application or computer program 210accesses and manipulates data accepted from I/O devices and stored inthe memory 206 of the computer 202 using the relationships and logicthat was generated using the compiler 212.

The computer 202 also optionally comprises an external communicationdevice such as a modem, satellite link, Ethernet card, or other devicefor accepting input from and providing output to other computers.

In one embodiment, instructions implementing the operating system 208,the computer program 210, and the compiler 212 are tangibly embodied ina computer-readable medium, e.g., data storage device 220, which couldinclude one or more fixed or removable data storage devices, such as azip drive, floppy disc drive 224, hard drive, CD-ROM drive, tape drive,DVD, etc. Further, the operating system 208 and the computer program 210are comprised of computer program instructions which, when accessed,read and executed by the computer 202, causes the computer 202 toperform the steps necessary to implement and/or use the presentinvention or to load the program of instructions into a memory, thuscreating a special purpose data structure causing the computer tooperate as a specially programmed computer executing the method stepsdescribed herein. Computer program 210 and/or operating instructions mayalso be tangibly embodied in memory 206 and/or data communicationsdevices 230, thereby making a computer program product or article ofmanufacture according to the invention. As such, the terms “article ofmanufacture,” “program storage device” and “computer program product” asused herein are intended to encompass a computer program accessible fromany computer readable device or media.

Of course, those skilled in the art will recognize that any combinationof the above components, or any number of different components,peripherals, and other devices, may be used with the computer 202.

Although the term “client computer” is referred to herein, it isunderstood that a client computer(s) 102 may include portable devicessuch as cellphones, portable MP3 players, video game consoles, notebookcomputers, pocket computers, personal data assistants (PDAs) or anyother device with suitable processing, communication, and input/outputcapability.

FIG. 3 is a diagram illustrating a content delivery subsystem (CDS) 300and top-level operations that can be used to deliver media programs andadvertisements for presentation to a user(s) 132.

In this embodiment, the content delivery subsystem 300 includes theclient computer(s) 102, the media program provider 110, and may includean advertisement provider 140. Although the advertisement provider 140is illustrated as a separate architectural entity than the media programprovider 110, the advertisement provider 140 may be integrated withinthe media program provider 110. The CDS 300 provides a means to providemedia programs and advertisements across a plurality of distributionnetworks, which may include www.hulu.com, www.imdb.com, www.aol.com orwww.msn.com. Metadata related to media program and advertisement contentis stored in the content delivery system 300, as is data describingwhere the media programs and advertisements may be found within the CDS300.

The client computer(s) 102 includes an interface module 302 and a mediaprogram player 304. The interface module 302 includes instructionsperformed by the client computer(s) 102 that are used to presentinformation to the user(s) 132 and to accept user input, includingcommands to select, play, and navigate the media program. The presentedinformation may include a program guide, the histogram and transcriptdescribed below.

In the illustrated embodiment, the web server 114 comprises a feedservice 306 and a content selector 308 communicatively coupled to acontent management service (CMS) 310. The media server 112 comprises amedia service 309 communicatively coupled to a memory 114 to securelystore the media programs.

Using the client computer(s) 102 and the interface module 302, the user132 enters a choice for a media program. The user interface module 302transmits a request message to the feed service 306 for a programidentifier (PID). The PID is used to identify content such as a specificmedia program running on a specific distribution network. For example,season 2 of “The Office” running on www.hulu.com will have a differentPID than the same exact show running on www.msn.com.

The feed service 306 then returns the PID to the user interface module302. The PID and associated commands (e.g. the play command) areprovided to the media program player 304. The media program player 304may be embedded in a webpage from a third party, or may be part of awebpage maintained by the media program provider 110, or may beimplemented in a standalone application.

The media program player 304 transmits the PID to a content selector308. The content selector 308 identifies the actual path (uniformresource locator or URL, for example) to the requested media programassociated with the transmitted PID, as well as programming metadatathat is used by the media program player 304 to help identifyrestrictions (for example, preventing access to adult-themed content ifthe user 132 is known to be under 18), advertising targeting rules aswell as advertising breaks. The advertising breaks indicate whereadvertisements may be inserted, and can be specified in terms of thetime since the beginning of the media program, the time remaining, orany other suitable measure. The URL and the metadata are thentransmitted to the media program player 304, along with otherinformation that is provided to allow the user 132 to select andnavigate media programs.

The media program player 304 provides a video request to a media server115 at the URL received from the content selector 308. The video requestincludes the PID and may include other information as well. The videorequest may be for streaming or progressively downloading the requestedmedia program.

The media server 112 responds to the video request by providing themedia program from a media service 309 to the client computer 102. Inone embodiment, the media server 112 is a service provider that has awide range of geographically distributed computers that store replicatedcopies of media programs and website content. The media service 114 canbe implemented by a third party provider (e.g. only contractuallyassociated with the media program provider 110), or can be part of themedia program provider 110. Geographically diverse servers assures thatwhen the user 132 requests playback of a media program, the mediaprogram is streamed from a geographically local server, to increasemedia program playback performance. Multiple servers also offerredundancy.

The media service 309 obtains the media program from secure storage 312which may be disposed in the media program provider 110 facility, orwhich may be disposed at a third party facility. The content managementservice 310 interfaces with the feed service 306, the content selector308, and secure storage 312 to manage which media programs are providedto the media service 114. External entities 314 can include thirdparties such entities that provide the advertisements that are to bedisplayed to the user(s) (which includes sponsors and/or theiradvertising agencies), third party providers of media programs, andentities from which the user(s) 132 may purchase goods or services.

The media program is transmitted to the media program player 304. Asdescribed above, the media program may be streamed or progressivelydownloaded to the media program player 304. In one embodiment, the mediaprogram player 304 plays the media program, and at the times identifiedin the advertising breaks defined above, requests advertisements fromthe advertising provider 140 using a path (e.g. URL) provided either bythe content selector 308 or the media server 112. The advertisingprovider 140 streams the requested advertisement is streamed to themedia program player 304 at the appropriate time, where it is displayedto the user(s) 132. In another embodiment, the media server 112 obtainsthe advertisements from the advertising provider 140 and inserts theadvertisements in the media program before the media program is streamedor progressively downloaded to the media program player 304.

Media Program Streaming

As described above, media program streaming allows a user(s) 132 toselect a media program (for example, by selecting a link on a referringwebsite), thus requesting a media network to stream the media program tothe viewer's computer as though it was coming from the referring websiteitself. Because the media files are typically hosted on a separatenetwork and provided by a separate media server, the streaming haslittle or no impact on existing web servers.

Media program streaming breaks the media program into segments sizedaccording to the bandwidth available between the client and the mediaserver. When the client computer has received enough segments, thesoftware can be playing one packet, decompressing another, anddownloading the third. This enables the user(s) 132 to listen or viewthe real-time file almost immediately, and without downloading theentire media file. This applies to live data feeds as well as storedmedia programs.

FIG. 4 is a diagram illustrating the streaming of a media program. Aclient media player 304 running in a client computer such as the clientcomputer(s) 102 issues a session request to a web server 406 to initiatea streaming session. The web server 406 responds by transmitting asession description to the client media player 304. Using the sessiondescription, the client computer(s) 102 then sets up a session with themedia server 112.

Data delivery control messages are then exchanged between the clientcomputer media player 304 and a media service 309 of the media server112 via a streaming protocol. The data delivery control messages controlthe request and delivery of data segments 404, each comprising aplurality of data packets that that together comprise the media program.Based on the data delivery control messages, the data segments 404 areretrieved from a media database 312 in which streaming media programsare stored, and transmitted from the server player 412 to the clientmedia player 304 via a transport protocol.

The stored streamed media programs are created by the process ofcapturing the video from an analog source and storing it to a disk. Thisis usually accomplished with a media player 420 providing a signal tovideo capture device 402 such as a computer having a video capture cardand the appropriate capture software. The video capture device may alsosupport the delivery of “live” video in addition to the storage of mediaprograms for later streaming.

After storage, the media program can be edited. An authoring tool mayalso be used to integrate the media program with other multimedia, ifdesired. The edited and integrated media program is then encoded to theappropriate streaming file format and stored in the database 312.Typically, this involves specifying the desired output resolution, framerate, and data rate for the streaming video file. When multiple datarates need to be supported (e.g. when the bandwidth of thecommunications link between the media server 112 and the client mediaprogram player 304 is measured and used to select the achievable datarate), multiple files may be produced corresponding to each data rate.Alternatively, one encoded media program can be produced that has adynamic bandwidth that can be adjusted to the bandwidth supported by thelink between the client media program player 304 and the media server112.

Using the data delivery control messages, the media server 112 managesthe delivery of the media program to the client media program player 304using the appropriate network transport protocols. Typically, the mediaserver 112 comprises a hardware platform that has been configured forthe delivery of real-time video plus media server software that runsunder an operating system that regulates the delivery of media programstreams.

The client media program player 304 receives and buffers the mediaprogram stream and plays it in an appropriate sized window on the clientcomputer(s) 102, typically via a videocassette recorder-like userinterface. Client media players 304 generally support such functions asplay and stop, as well as trick play function such as fast forward, fastreverse, and seek. The client media player 304 can be stand-alone mediaplayer or may be a browser 418 plug-in or an Active-X control.

The protocols used to stream media programs may include:

Hypertext Transfer Protocol (HTTP): An application-level protocol fordistributed, collaborative, hypermedia information systems. It is ageneric, stateless, object-oriented protocol that can be used for manytasks, such as name servers and distributed object management systems,through extension of its request methods. As shown in FIG. 4, thesession request will typically conform to HTTP.

Session Description Protocol(SDP): STP is a media description formatused to describe multimedia sessions for the purposes of sessionannouncement, session invitation, and other forms of multimedia sessioninitiation. In the example shown in FIG. 4, the session description mayconform to SDP.

Real Time Streaming Protocol (RTSP): RSTP is an application-levelstreaming protocol that is used to control the delivery of data withreal-time properties such as the media program itself. RTSP providescontrolled, on-demand delivery of audio and video. Typically, this isaccomplished using the Transmission Control Protocol (TCP). TCP managesmessage acknowledgement, retransmission, and timeout, and requireshandshaking to assure messages are delivered. If messages are lostduring transmission, retransmission of the lost message is requested.Once a connection is set up, user data may be sent bidirectionally overthe connection. TCP can be used to send commands from the client 404(such as “start” and “pause”) and sending commands from the media server112 to the client media program player 304 for specific information(such as the title of media programs and clips). RSTP messages mayinclude request messages and response messages. Request messages can besent by a client computer 102 to the media server 112 server or viceversa, and include the method to be applied to the resource and theidentifier of the resource. Response messages are transmitted inresponse to an RSTP request message. RSTP methods can include thefollowing:

DESCRIBE: The DESCRIBE method retrieves the description of a mediaprogram identified by the request (typically identified by the URLassociated with the media program). This function includes request andresponse pair that are used to initialize the link.

SETUP: The SETUP method specifies the transport mechanism to be used forthe streamed media program. Transport parameters acceptable to theclient media player 304 for data transmission are specified, and theresponse from the server includes the transport parameters selected bythe media server 112.

OPTIONS: The OPTIONS method requests available methods.

ANNOUNCE: The ANNOUNCE method requests a description of media object

PLAY: The PLAY method starts or repositions playback of the mediaprogram. The PLAY method tells the media server 112 to start sending themedia program data via the mechanism that was specified in the setupcommand. The PLAY request positions the normal play time to thebeginning of the range specified and delivers stream data until the endof the range is reached

REDIRECT: The REDIRECT method redirect client media player 304 todifferent media server 112.

PAUSE: The PAUSE method causes the stream delivery to be interrupted(halted) temporarily. If the request URL names a stream, only playbackand recording of the named stream is halted. For example, for audio,this is equivalent to muting. If the request URL names a presentation orgroup of streams, delivery of all currently active streams within thepresentation or group is halted. After resuming playback or recording,synchronization of the tracks is maintained. Media server 112 resourcesare maintained, though servers MAY close the session and free resourcesafter being paused for the duration specified with by timeout parameter,which may be provided in the message sent with the SETUP method.

SET PARAMETER: This method permits device or encoding control.

TEARDOWN: The TEARDOWN method stops the stream delivery for the mediaprogram associated with a given uniform resource identifier (URI),freeing the resources associated with it. If the URI is the presentationURI for currently streamed media program, any RTSP session identifierassociated with the session is no longer valid. Unless all transportparameters are defined by the session description, a SETUP request mustbe issued before the media program can be played again.

Real-time Transport Protocol(RTP): RTP is a user datagram protocol (UDP)packet format and set of conventions that provides end-to-end networktransport functions suitable for applications transmitting real-timedata, such as audio, video or simulation data, over multicast or unicastnetwork services. UDP is simple, connectionless protocol in whichinformation is transmitted in one direction from the source to thedestination without verifying the readiness or state of the receiver.

Real-time Control Protocol (RTCP): RTCP is the control protocol thatworks in conjunction with RTP. RTCP control packets are periodicallytransmitted by each participant in an RTP session to all otherparticipants. RTCP is used to control performance and for diagnosticpurposes.

Media programs may be transmitted on-demand, live, or simulated. live.When transmitted on-demand, a clip of a media program is available tothe user(s) 132 whenever the user(s) want it. The user(s) canfast-forward, rewind, or pause the clip, and media server 410 will sendthe portion of the media program that is requested. Typically, on-demandmedia programs are prerecorded or pre-assembled.

Live delivery allows the user(s) to tune in to the action that iscurrently being transmitted. Since the user(s) can not fast-forward orrewind through the clip, because the media program is being transmittedin real time. Delivery of content as a live event requires that the livemedia program be encoded for streaming video delivery. Live mediaprograms or clips can be delivered via unicasting, splitting, ormulticasting. Unicast media programs are simply transmitted to theclient as any other media program. Splitting allows the media server toshare its live media streams with other media servers. Clients connectto the other media servers rather than to the main media server wherethe streams originate. This allows the traffic load on the originatingmedia server to be reduced, allowing it to distribute other mediaprograms at the same time. Multicasting allows media programs to bedelivered to subsets of the uses sharing a common communication channel.

Simulated live delivery is analogous to the delivery of pre-recordedevents that are broadcast as if they were live events (e.g. thetransmission of sporting events into different time zones).

The present invention presents the user with additional information andallows that additional information to be used to navigate the mediaprogram. That additional information includes (1) a histogram indicatingthe relative popularity of different portions of the media program and(2) a searchable transcript of the dialog of the media program. In oneembodiment, both the histogram and the searchable transcript arepresented to the user 132 concurrently, and are linked so that userinput with regard to one is reflected in the other, but the histogramand the searchable transcript may be separately presented. Thesefeatures allow the user 132 to find a known portion of the media programof interest quickly, and also allows the user to find portions that maybe of interest.

Display of Histogram Showing Number of Views

FIG. 5 is a diagram presenting exemplary method steps that can be usedto practice one embodiment of the present invention. In block 502, afirst set of media program segments 140A are streamed to a first clientcomputer 102A. This can be initiated in accordance with a sessionrequest issued by the client computer 102A, or a trick play function(e.g. fast forward, seek, reverse, or pause) initiated by the userassociated with the client computer 102A. In block 504, a second set ofmedia program segments 140B are streamed to a second client computer102B. Generally, the first set of media program segments 140A aredifferent than the second set of media program segments 140B. Forexample, the first set of media program segments 140A may represent theportion of the media program extending from time 0:20:00 to time0:25:00, while the second set of media program segments 140B mayrepresent the portion of the media program extending from time 0:10:00to 0:12:00, or from time 0:17:00 to 0:23:00. However, first set of mediaprogram segments 140A and the second set of media program segments 140Bmay be the same media program segments.

Next, a histogram of the combined first set of media program segmentsand the second set of media programs is generated, as shown in block506. In block 510, user interface data is transmitted to the clientcomputer 102C for presentation in a user interface. The user interfacedata comprises the histogram having features such as bars that representthe media program segments. The histogram is preferably presentedconcurrently with the media program player 304 on the same display 222.As shown in block 508, the histogram 700 may be provided, for example,in response to a histogram request from the third client computer 102C,or may be provided any time the user 132 requests a streamed mediaprogram. Further, although the histogram request may come from a thirdclient computer 102C, it may come from the first client computer 102A,the second client computer 102B, or any client computer 102.

FIG. 6 is a diagram illustrating one embodiment of how the foregoingoperations can be implemented. Essentially, this can be accomplished by(1) monitoring the data delivery control messages exchanged between theclient computer 102 and the media server, and transmitting the monitoredinformation to the web server 114, or by (2) monitoring thetransmission/reception of the media program segments or packets by themedia server 112/client computer 102 and transmitting this informationto the web server 114.

The control messages may be monitored by the client computer 102 andtransmitted to the web server 114, or may be monitored by the mediaserver 112 and transmitted to the web server 114. Similarly, thetransmission of the media program may be monitored by the media server112 and transmitted to the web server 114, or the reception of the mediaprogram may be monitored by the client computer 102 and transmitted tothe web server 114.

Monitoring Data Control Messages

In one embodiment, information indicating which media segments have beenrequested and/or have been transmitted to the client media player 304 isobtained by monitoring the data delivery control messages between theclient media player 304 and the media server 410. This can beaccomplished by monitoring data delivery control messages to determinedirectly which segments were requested and acknowledged, or by inferringwhich segments were requested and transmitted by play and trick playcommands.

For example, if a PLAY command has been issued by the client mediaprogram player 304 to the media server 112, it can be assumed that allof the segments were transmitted and received by the client mediaprogram player 304 until a command is received that would halt deliveryof the segments (e.g. a PAUSE, STOP, or TEARDOWN command, for example).Similarly, it can be assumed that following a PLAY command, all segmentswere transmitted and received by the media program player 304 until atrick play command is received (e.g. moving a control on a progress barfrom on temporal location of the media program to another.

As another example, the client media program player 304 may generate aSEEK events such as SEEK_BACK and SEEK_FORWARD. A SEEK_BACK eventcommands the media program to begin replay of the media program from alocation that is temporally behind the current playback point. In oneembodiment, the current playback point is visualized as a cursor such asthe slider control 1118 depicted in FIG. 11 that can be moved forwardand backwards to navigate through different locations on the progressbar 1116, each of which represent a different temporal location in themedia program. A SEEK_BACK command is issued from the client mediaprogram player 304 if the slider control 1118 is selected and moved tothe left, and a SEEK_FORWARD command is issued from the client mediaprogram player 304 if the slider control 1118 is moved to the right. TheSEEK_BACK and SEEK_FORWARD commands specify the original cursor position(before the SEEK_BACK command is issued), and the desired cursorposition (after the SEEK_BACK command is completed). Using thisinformation, the SEEK_BACK command can be used to contribute a firstpopularity score the affected portion of the media program (temporallybetween the original cursor position and the current cursor position)and the SEEK_FORWARD command can be used to contribute a secondpopularity score (lower than the first popularity score) for theaffected portion of the media program. In one embodiment, the firstpopularity score can be a positive value and the second popularity scorecan be a negative value, and the positive and negative scores can besubtracted from a baseline score.

The monitoring of trick play functions rather than individual segmentsor blocks of segments also has other advantages. Since users 132generally use trick play functions to navigate to portions of the mediaprogram that they are interested in, it is reasonable to assume that theportion that is navigated to was a portion of significant interest tothe user 132. The same is not necessarily true for users who simply playthe media program from start to finish. A user 132 who has startedplayback of a media program and entered no further commands until themedia program has completed playback is likely less interested in anyparticular portion of the media program than a user that used a trickplay function to view the same portion of the media program, and may notbe watching the media program at all. However, a user 132 who isnavigating through the media program through a series of trick playfunctions is very likely watching the media program, and very likelyinterested in the portion of the media program they are navigating to.Accordingly, in determining popular portions of the media program,heavier weight may be given to data regarding segments that are viewedafter one or more trick play navigation commands. Further, segmentsplayed between a series of trick play commands that are closelydistributed in time are an indication that the user is navigating themedia program to find a desired scene, and such segments can be ignoredand not counted in the playback statistics presented in the histogram700, if desired.

The monitoring and transmission of data delivery control messages to theweb server 114 may be implemented in the client computer 102 (via ahardware module, a routine in the client media program player 304, abrowser 418 plug-in, or an application in the client computer), or maybe implemented in the media server 112.

The data control messages may also be accompanied by informationdescribing the media program being viewed and the client computer 102 oruser 132, so that the data delivery control messages are distinguishablebetween client computers 102 and media programs.

Transmitting Playback Information to the Media Server

The monitoring of the data delivery control messages exchanged betweenthe client computer 102 and the media server 112 uses information thatis already exchanged for purposes of streaming the media program.However, information regarding which segments have been viewed can beobtained without resort to the data delivery control messages. Forexample, the client media program player 304 may simply transmit theappropriate information to the web server 114 via the user interfaceHTTP connection established during set up, or through a separatelightweight HTTP connection established for that purpose. Theinformation transmitted can be analogous to the data delivery controlmessages passed between the client media program player 304 and themedia server 112 (the SEEK_BACK and SEEK_FORWARD, for example), or myinstead comprise other information.

In one embodiment, a beacon service is implemented to transmit the mediaprogram segment viewing information to the web server 114. The beaconservice is an application that transmits beacon information to the webserver 114, either through an existing communication link such as theuser interface HTTP connection, or through a separate lightweight HTTPconnection. Beacon information can be inserted into the media programbefore the media program is streamed to the client computer 102. Forexample, beacon information may be inserted into or between frames orpackets every three minutes. When the client computer 102 plays theportion of the media program or encounters the packet, a beacon istransmitted from the client computer 102 to the web server. Thereception of the beacon indicates that an associated portion of thatmedia program has been played back. If the user were to issue aSEEK_BACK command, that beacon would be encountered again duringplayback and transmitted to the web server 114 again, thus indicatingthat the associated portion of the media program has been replayedagain. In this embodiment, the transmitted beacons may be unique suchthat they can be distinguished from one another (each beacon having adifferent identifier than any other beacon). Additional informationindicating the media program and the user may be transmitted, but isunnecessary so long as the transmitted beacon is globally unique.Alternatively, the beacon may be only unique to the media program (e.g.the first beacon may simply be the number “3” indicating that firstthree minutes of the media program has been replayed), and theinformation transmitted may include a media program identifier todistinguish it from beacons of other media programs.

Alternatively or in addition to the fixed beacons described above,beacon information can be generated by the beacon service based playbackof the media program or user commands. This “seek beacon” information isthen transmitted to the web server 114 and used to determine whichsegments of the media program have been replayed by the client computer102. For example, if the user 132 issues a SEEK_BACK command, a beaconhaving information regarding the original cursor position and the cursorposition after the SEEK_BACK command may be transmitted to the webserver 114. As another example, if a media program is shared betweenusers (e.g. user 132A sends a link to user 132B with the URL of theshared video), the second user's client media program player 304 sendsjust one SEEK_FORWARD event to the media server 112 requesting the mediaprogram at the start time. If no further information is receivedregarding the playback of that media program, it can be inferred thatthe second user played back the remainder of the media program. Asbefore the information may include a media program identifier so thatthe media program associated with the command may be determined, or theidentity of the media program may be inferred from.

Monitoring Media Program Segment Transmission or Reception

In another embodiment, the information may be obtained by monitoringwhich segments are actually transmitted by the media server 112 using ahardware, software, or firmware module in the media server 112, andtransmitting this information to the web server 114 or headend.Alternatively, information may be obtained by monitoring which segmentsare received by the client computer 102 and transmitting thisinformation to the web server 114 or headend. In this embodiment, amodule implemented in the client media player 304, browser 418, orelsewhere in the client computer 102 may transmit a message to the webserver 406 indicating which media program segments have been received,for example, using an Internet link and HTTP.

In any of the foregoing embodiments, the monitoring of the portions ofthe media program transmitted or received can be accomplished at apacket level, rather than a segment level. It is also possible toindicate which segments have been received merely by transmitting thebeginning and end time for the portions of the media program that havebeen received.

FIG. 7 is a diagram illustrating a typical embodiment of at least aportion of a displayed histogram 700, and illustrating one method forgenerating the histogram. The histogram comprises a plurality offeatures such as bars 701, each bar 701 representing one or moresegments of the media program. In the illustrated embodiment, each bar701 represents thirty seconds of the media program. The length of eachbar 701 is determined by the popularity “score” of the media programrepresented by the media segment represented by the bar 701. In oneembodiment, the length of the bar 701 may indicate the number of “views”of the media segment or media segments represented by the bar 701. Forexample, bar 701 is shorter than bar 706, to indicate that the segmentor segments represented by bar 701 were viewed fewer times than thesegment or segments represented by bar 706. Of course, the temporallength of the segments can be selected to be longer or shorter asdesired.

One technique in using trick play (e.g. SEEK) commands to determine thepopularity of a particular video segment is also illustrated in FIG. 7.Item 710 is a baseline histogram 710 wherein all of the segments aredeemed to be of equal popularity. SEEK_FORWARD and SEEK_BACK commandsare illustrated using open-tailed arrows 712A and 712B. Arrow 712Aillustrates a SEEK_BACK command from the temporal location indicated bythe open tail to the temporal location indicated by the solid arrow tip,whereas arrow 712B indicates a SEEK_FORWARD command from the temporallocation indicated by the open tail to the temporal location indicatedby the solid arrow tip. Arrows 712C and 712D indicate SEEK_BACK commandsthat may have been received from a different client computer 102 thanthe SEEK_BACK command represented by arrow 712A.

The popularity of each segment 701 can be determined by adding a “view”if the segment was viewed with a SEEK_BACK command, and subtracting a“view” if the segment was bypassed by a SEEK_FORWARD command. Forexample, in the embodiment illustrated in FIG. 7, segments 706 were thereplayed three additional times due to SEEK_BACK commands 712A, 712D,and 712C, and hence the corresponding features 706A representing thesegments in the histogram are increased in length to indicate threeadditional views over the baseline. Similarly, segments 702 were skippedby SEEK_FORWARD command 712B, and the corresponding features 702Arepresenting the segments in the histogram are decreased in length toindicate one less view than the baseline. Other embodiments arepossible, for instance, segments that are skipped via the SEEK_FORWARDcommand may simply be give no viewing score (rather than subtracting aview).

In one embodiment, the data regarding how many times a segment is viewedis received in a module in the web server 406, which compiles theinformation together to determine the total number of times each segmenthas been viewed, computes the histogram 700 and transmits the histogram700 to the client computer 102 for presentation, typically by a moduleresident in the browser 418 or client media program player 304. However,other embodiments are possible in which the foregoing operations areallocated between the client computer 102 and the web server 406 andmedia server 112 differently. For example, histogram statistics may begenerated by the media server 112 and transmitted to the web server 406.

Navigation of the Media Program via Histogram

FIG. 8A is a diagram illustrating exemplary method steps in which thehistogram can be used to assist in the navigation of the media program.In block 802, a representative thumbnail of the media program isassociated with each of the media program segments. Typically, the webserver 406 identifies a thumbnail that represents each of the segmentsof the media program.

In block 804, user input is accepted identifying one of the presentedmedia program segments. In block 806, the representative thumbnail ofthe media program associated with the identified one of the mediaprogram segments is displayed.

FIG. 8B is a diagram illustrating exemplary method steps that can beused to generate the thumbnails from which representative thumbnails areassociated with media program segments. In block 852, a frame isselected for each of the segments of the media program. In block 854, athumbnail is generated from each of the selected frames of the segments.

FIG. 9 is a diagram illustrating one embodiment of a user interfacecomprising a histogram 700 and a thumbnail 904. In this embodiment, whenthe user 132 manipulates the pointing device 216 of the client computer102 to hover the cursor 902 over a bar 906 representing one or moresegments of the media program, the bar 906 changes to delineate the barfrom the other bars, and a thumbnail 904 of a frame of one or more mediasegments is displayed adjacent the user selected bar. As the cursor 902is moved to hover over each of the bars of the histogram depicting thesegments, the displayed thumbnail 904 changes so as to display thethumbnail associated with the one or more segments. If desired, the timeof the segment can be displayed along with the thumbnail 904.

Using the pointing device 216 and the cursor 902, the user 132 may thenclick the bar 906 or the thumbnail, thus commanding the media player toplay the media program beginning at the portion of the media programassociated with the bar 906 or thumbnail 904.

Presentation of the Media Program Transcript

In one embodiment of the present invention, a transcript of the mediaprogram dialog can be presented, either in addition to, or in thealternative to the histogram 700 described above.

FIG. 10 is a diagram illustrating exemplary steps that can be used topresent a transcript of the dialog of a media program to the user. Inblock 1002, a textural transcript of the dialog is generated. Thetranscript includes a plurality of portions, each of which is associatedwith a segment of the media program. In one embodiment, the portions aredefined by closed captioning phrases. In block 1004, a command todisplay at least a portion of the textural transcript is accepted. Inone embodiment, the command to show the transcript is independent fromany previous commands. In another embodiment, the command to show thetranscript is received in response to a selection of one of the featureson a user interface, which may include the bars on the histogram. Instill another embodiment, the command to show the transcript provided byselecting a tab on the user interface. The command may also be a commandthat is received from a client computer 102 to select or view a mediaprogram, and the transcript is transmitted along with other data that isused to view and navigate the media program. In another embodiment, acommand to select or view the media program is already received, and theuser selects a further control in order to transmit the command to viewthe media program transcript. Examples are shown in FIG. 11.

Finally, in block 1006, the user interface data is transmitted to theclient computer 102 where it is displayed. The user interface dataincludes the requested portions of the textural transcript. The userinterface presents the textural transcript using the client computer102. In one embodiment, the textural transcript is presentedconcurrently with a media program player. Further, as described below,user interface elements such as the media program player, the histogram700, and the transcript may be synchronized so the selection of anavigation control of one of the elements is reflected in the otherelements.

For example, if the command to show the transcript was performed byselecting one of the bars of the histogram 700, the portion of thetranscript that is displayed is the portion associated with the portionof the program represented by the bar of the histogram. Further, ifmedia program player commands are used to navigate the media program,those commands are reflected in the histogram and the texturaltranscript.

FIG. 11 is a diagram presenting an exemplary user interface 1100 bywhich the user may elect to view the transcript of a media program. Theuser interface 1100 includes a media player window 1114 in which themedia program is rendered. The media player window 1114 includes aprogress bar 1116 which indicates which portion of the media program iscurrently being played. The user may grab a slider control 1118 and moveit left and right to begin play at an earlier or later point in themedia program, respectively. The user interface 1100 may also includemedia player controls 1120A-1120E (alternatively referred to hereinafteras media player control(s) 1120). The media player controls 1120 mayinclude a stop control 1120C, a play control 1120D, a pause control1120B, a fast reverse control 1120A and a fast forward control 1120E.

The interface 1100 also includes a plurality of tabs 1102A-1102E, whichallow the user to select information regarding the media program ofinterest. Tab 1102A allows the user to show default informationregarding the media program. Tab 1102B allows the user to view reviewsof the subject media program. Tab 1102C allows the user to view andparticipate in discussions with other users regarding the subject mediaprogram. Tab 1102D allows the user to view captions, or a transcript ofthe media program. Tab 1102E allows the user to create custom widgetsthat can be embedded in other websites that show the selected mediaprograms.

In the embodiment shown in FIG. 11, the user 132 has selected the“captions” tab 1102D. In response to this selection, at least a portionof a transcript 1104 of the media program is presented in the userinterface 1100. The transcript 1104 includes the phrases of the dialogand an adjacent time stamp, indicating when the phrase was uttered inthe media program. In the illustrated embodiment, thumbnails 1106associated with the phrases are also presented adjacent the phrase. Thethumbnails 1106 may be generated manually, or by software which capturesa frame temporally associated with the phrase of the transcript 1104.Further, the thumbnails 1106 can be generated in advance or in realtime, when the user 132 selects the captions tab 1102D.

The user 132 can navigate to different portions of the transcript 1104by selecting other pages, using controls 1108. Or, the user 132 cannavigate to other portions of the transcript 1104 by selecting a bar 902in the histogram 700. For example, if the user 132 moves the cursor overbar 902 that shows the relative number of views of a particular portionof the media program and selects that bar 902 (e.g. by clicking on it),the portion 1110 of the transcript 1104 associated with that particularportion of the media program is highlighted, as is the selected bar 902.If the selected bar 902 represents multiple phrases of the transcript1104, those multiple phrases may be highlighted.

In one embodiment, the histogram 700 and the transcript are synchronizedso that the same time interval of the media program is presented onboth. In this embodiment, the histogram 700 and transcript 1104 fordifferent portions of the media program can be viewed by manipulating acontrol on either the histogram 700 or the transcript 1104. For example,in the illustrated embodiment shown in FIG. 11, if the user 132 wishesto see the histogram 700 for portions of the media program after 0:40,the user may select histogram navigation button 1112, and the histogramwill show results for a different portion of the media program (e.g.from 0:05 to 0:45 instead of 0:00 to 0:40 as currently illustrated). Theportion of the transcript 1104 can be automatically changed to also showthe selected portion of the media program (e.g. 0:05 to 0:45).Alternatively, the user may navigate the transcript 1104 (e.g. usingtranscript control 1108). In response, the transcript 1104 for differentportions of the media program are presented, and the histogram 700 isupdated so as to be synchronized with the transcript.

In other embodiments, the histogram 700 may reflect a larger portion ofthe media program than the transcript. For example, the histogram 700may show the relative number of viewings for the entire media program.In this case, the histogram 700 may be used to navigate to differentportions of the media program (e.g. by clicking on one of the bars), andthe transcript will follow suit to show the dialog for the portion ofthe media program that was selected. In this embodiment, the histogram700 may include further controls that allow the user 132 to expand thehistogram 700 to show the entire media program or to contract thehistogram 700 to show data for only a portion of the media program.

Transcript information can be obtained in a number of ways. In oneembodiment, it is simply obtained from the closed captioning informationencoded into the media program, whether by analog (line 21 of thevertical blanking interval, as defined by EIA-608, which is herebyincorporated by reference herein) or digital (EIA 708B, which is herebyincorporated by reference herein) means. In other embodiments, thetranscript information can be obtained from the media program providerin a separate file.

FIG. 12A is a diagram presenting exemplary method steps that can be usedto generate the textural transcript. In block 1202, the media program isplayed. This can be accomplished by a media player 420. In block 1204,the closed captioning information of the played media program isdecoded, for example, by use of decoder 422. In block 1206, the decodedclosed captioning (CC) information is associated with a temporal indexof the media program. The temporal index will typically represent time,but it can represent frames, or blocks, or any other value that is areasonable proxy for time. In block 1208, a textural transcript isgenerated from the decoded CC and the temporal index. This informationis stored, for example, in database 312. In one embodiment, theinformation is transmitted to the web server 114 for storage.

In one embodiment, the user 132 may perform keyword searches of thetranscript to identify portions of the media program that might be ofinterest.

FIG. 12B is a diagram presenting exemplary steps that can be used toperform a keyword search of the transcript 1104 of the media program.First, a search query, comprising a keyword is accepted, as shown inblock 1202. This can be accomplished, for example, by entering thekeywords in the search box 1112 shown in FIG. 11. The texturaltranscript 1104 is then searched for the entered keywords, as shown inblock 1204. The results can be presented by annotating the histogram 700to illustrate which segments include textural information matching theentered keyword or keyword combinations, by annotating the transcript1104 to show phrases where the keywords or keyword combinations arefound, or both, as shown in blocks 1206 and 1208.

FIG. 13 is a diagram illustrating the user interface shown in FIG. 11after the entry and search for keywords. In the example shown, thekeyword “disease” was entered into a text entry box 1012, and a searchperformed. The transcript 1104 now shows the phrases in the mediaprogram in which the keyword was used, with the keyword itselfhighlighted. Representative thumbnails 1106 for the segments of themedia program that include the keyword in the dialog can also bepresented proximate the depicted portion of the transcript. Also,histogram 700 now shows bars 1302 representing those segments orportions of the media program where the keyword “disease” is included inthe dialog. The user 132 may now navigate to different portions of themedia program in which the dialog includes the word “disease” byselecting the highlighted bars 1302, the thumbnails 1106, or the “playthis” control associated with each phrase.

The foregoing operations may be performed by one or more hardware orsoftware modules implemented in the client computer 102, web server 406,and/or the media server 112. For example, the client computer 102 canimplement one or more hardware or software module that accepts the userinput, generates the required information to display to the user basedon data received from the web server 114, and renders the informationfor display. Alternatively, the web server 114 may implement one or morehardware or software modules that accept the a message having the userinput, generate the required information to display to the user 132, andtransmit a message to the client computer 102, where the information canbe displayed.

The functions and functionality described above can be implemented usinghardware modules dedicated to the indicated functionality, or can beimplemented using software or firmware modules having instructions beingexecuted by processors. Furthermore, while the above disclosure presentsembodiments in which functionality has been allocated to differenthardware and/or software elements, the present invention can beimplemented using different functional allocations as well. For example,although FIG. 4 depicts a web server 114 and a media server 112,embodiments may be practiced in which the functions performed by the webserver 114 and media server 112 are a single entity. Further, otherembodiments are possible wherein functions depicted as being allocatedto one entity can be allocated to another. For example, the clientcomputer 102 may store only sufficient user interface data to permitdisplay of the user interface elements to the user, and rely on requeststo the web server to obtain data in response to user input.Alternatively, the user interface data transmitted to the clientcomputer 102 may include sufficient data to allow synchronizationbetween all histogram and transcript for all portions of the mediaprogram without the need for further data from the web server 114 or themedia server 112. Accordingly, a process step of accepting a user inputmay refer to the acceptance of the user input in the client computer102, or of the acceptance of a message from the client computer 102 bythe web server 114 or media server 112, wherein the accepted messageincludes user input.

FIG. 14 is a diagram of the histogram portion of another embodiment ofthe user interface 1110. In this embodiment, the user 132 can tagselected segments with keywords. This feature is useful in that itallows the user 132 or other users to find segments of interest. FIG. 14illustrates on embodiment of a histogram 700, showing the relativepopularity of segments of a media program. A keyword search or otheractivity has identified some of the segments as more relevant thanothers, and these segments are delineated (for example, by darkeningthem) from the other segments. The user 132 can provide a tag command tothe client computer 102 to select one or more of the segments fortagging. This can be accomplished, for example, by placing the cursor902 over features 1402A and the three features 1404A and clicking thepointing device 216. In response, respective windows 1402 and 1404 areopened. The user may then enter text and/or symbols in the windows 1402and 1404 to enter keyword(s). The entered text can be accepted byclicking away from the histogram 700.

Tags may be searched using a procedure analogous to that which isdescribed above with respect to keyword searches of the transcript. Whena tag keyword search is completed, the histogram 700 updated to indicatewhich segments of the media program have dialog that includes thesearched tag keywords.

In one embodiment, entered tags may be locally stored in a memory of theclient computer 102, so that they may be kept private, yet stillsearchable to find selected segments of the media program. In anotherembodiment, entered tags are transmitted to the web server 114 forstorage in a database, thus enriching the information available in thedatabase. This database can be made available for keyword searches byother client computers 102. In a further embodiment, whether tags aremade available for searching by others can be controlled by the user 132that entered the tag, with some tags shared and others held private.

For example, if user 132A tags the histogram from media program A, thattag may be entered into a web server 110 database and available forsearching by user 132B using client computer 102B. Since the tag isassociated with one or more media program segments, the information canbe used to identify those portions of the media program that areresponsive to the keyword. For example, if one user 132A entered thekeyword “chase” to refer to media program 1404A and that tag is enteredinto the database for use in searches by other users, if user 132Benters a keyword search using the keyword “chase” either to search thesame media program, a collection of media programs, or all availablemedia programs, the portion of the media program represented by sections1404A may be returned to that user 132B. The search results may bepresented in a variety of different ways, including presentation eachmedia program responsive to the keyword, along with a thumbnail and theportion of the media program that included the keyword (for example, theelapsed time when the portion begins and ends). For example, if user132B performs a keyword search using the term “chase” the media programfor which the histogram is pictured in FIG. 14 may be shown, the resultmay be the name of the media program, the portion of the media programassociated with the keyword (in the illustrated embodiment, 31:00-32:00minutes), and a thumbnail of that portion of the media program (e.g. aframe selected from that portion of the media program).

In one embodiment, the histogram 700 is also used to identify portionsof the media program that the user 132 would like to embed into awebpage. This can be accomplished by accepting a command identifying oneor more of the features of the histogram representing the desiredportion of the media program, and automatically generating instructionsfor embedding the segments represented by the identified features into awebpage.

FIG. 15 is a diagram presenting an exemplary user interface 1500 forcreating code that can be used to embed a selected portion of the mediaprogram into a webpage. In the user interface 1500 a reproduction of thehistogram 700 is presented, along with a control bar 1502. Control bar1502 also includes delimiters 1504 that the user can select and slidealong the control bar to indicate which segments of the media program(as represented by bars 902) that the user would like to embed into awebpage. Since the histogram 700 graphically illustrates the mostpopular portions of the media program, the it is easier for the user 132to identify the desired segments. As described before, thumbnails may bedisplayed when the user hovers the cursor over the bars 902, allowingthe user to preview the video content of the segment. Further, when adelimiter 1504 is moved to a location on the control bar 1502, athumbnail portion 1508 of the user interface 1500 may be updated to showa thumbnail from the segment of the new location. The user may alsocreate a custom thumbnail. Source code 1506 for embedding into thewebpage is automatically generated in response to the entered positionof the delimiters 1504 along the control bar 1502, and may be copied tothe clipboard by selecting a single control in the user interface 1500.

Conclusion

This concludes the description of the preferred embodiments of thepresent invention. The foregoing description of the preferred embodimentof the invention has been presented for the purposes of illustration anddescription. It is not intended to be exhaustive or to limit theinvention to the precise form disclosed. Many modifications andvariations are possible in light of the above teaching. It is intendedthat the scope of the invention be limited not by this detaileddescription, but rather by the claims appended hereto. The abovespecification, examples and data provide a complete description of themanufacture and use of the composition of the invention. Since manyembodiments of the invention can be made without departing from thespirit and scope of the invention, the invention resides in the claimshereinafter appended.

What is claimed is:
 1. A method of navigating a video program having aspoken dialog, comprising the steps of: generating a full textualtranscript of the spoken dialog, the full textual transcript comprisinga plurality of portions wherein each portion is associated with asegment of the video program; synchronizing each portion of the fulltextual transcript with the associated segment of the video program;transmitting user interface data including the full textual transcriptto the client computer for presentation in a user interface, the userinterface comprising a concurrently presented video program playerplaying the video program and a window configured to include portions ofthe full textual transcript; accepting a selection for a portion of thefull textual transcript included in the window of the user interface;determining the associated segment for the portion of the full textualtranscript based on the synchronizing; and dynamically navigating to thesegment of the video program associated with the selected portion of thefull textual transcript in the video program player in response to theaccepted selection.
 2. The method of claim 1, wherein the video programis encoded with closed captioning information and the step of generatinga full textual transcript of the spoken dialog of the video programcomprises the steps of: playing the video program; decoding the closedcaptioning information of the played video program; associating thedecoded closed captioning information with a temporal index of the videoprogram; and generating the full textual transcript from the decodedclosed captioning information and the temporal index.
 3. The method ofclaim 1, further comprising the steps of: selecting a frame from asegment of the video program; generating a thumbnail from the selectedframe of the segment; associating the generated thumbnail with theportion of the full textual transcript associated with the segment ofthe video program; and adding the generated thumbnail to the fulltextual transcript.
 4. The method of claim 3, wherein the generatedthumbnail is displayed proximate to the portion of the full textualtranscript associated with the segment of the video program.
 5. Themethod of claim 1, wherein the user interface data further comprises ahistogram having a plurality of features, each feature representing atleast one of the segments of the video program and indicating how oftenthe represented at least one of the segments of the video program hasbeen viewed by a second client computer and a third client computer. 6.The method of claim 5, wherein the histogram is generated by performingsteps comprising the steps of: streaming at a first set of the videoprogram segments to a first client computer; streaming a second set ofthe video program segments to a second client computer; and generatingthe histogram from the combined first set of video program segments andthe second set of video program segments.
 7. The method of claim 5,further comprising the steps of: accepting a first user input in theclient computer to select one of the features; and displaying theportion of the full textual transcript associated with the segmentrepresented by the selected feature.
 8. The method of claim 5, furthercomprising the steps of: accepting a search query having a keyword;searching the full textual transcript for a keyword; and annotating thefull textual transcript to delineate the video program segments havingdialog including the keyword.
 9. The method of claim 5, furthercomprising: displaying the histogram concurrently with the video playerand the window including portions of the full textual transcript,wherein the plurality of features of the histogram are synchronized withportions of the full textual transcript; accepting a selection of afeature of the plurality of features; determining a correspondingportion of the full textual transcript; and displaying the correspondingportion of the full textual transcript in response to receiving theselection of the feature.
 10. The method of claim 1, wherein the portioncomprises a phrase.
 11. A non-transitory computer-readable storagemedium containing instructions for navigating a video program having aspoken dialog, the instructions for controlling a computer system to beoperable for: generating a full textual transcript of the spoken dialog,the full textual transcript comprising a plurality of portions whereineach portion is associated with a segment of the video program;synchronizing each portion of the full textual transcript with theassociated segment of the video program; transmitting user interfacedata including the full textual transcript to the client computer forpresentation in a user interface, the user interface comprising aconcurrently presented video program player playing the video programand a window configured to include portions of the full textualtranscript; accepting a selection for a portion of the full textualtranscript included in the window of the user interface; determining theassociated segment for the portion of the full textual transcript basedon the synchronizing; and dynamically navigating to the segment of thevideo program associated with the selected portion of the full textualtranscript in the video program player in response to the acceptedselection.
 12. The non-transitory computer-readable storage medium ofclaim 11, wherein the video program is encoded with closed captioninginformation and the means for generating a full textual transcript ofthe dialog of the video program comprises: playing the video program;decoding the closed captioning information of the played video program;associating the decoded closed captioning information with a temporalindex of the video program; and generating the full textual transcriptfrom the decoded closed captioning information and the temporal index.13. The non-transitory computer-readable storage medium of claim 11,further operable for: selecting a frame from a segment of the videoprogram; generating a thumbnail from the selected frame of the segment;associating the generated thumbnail with the portion of the transcriptassociated with the segment of the video program; and adding thegenerated thumbnail to the full textual transcript.
 14. Thenon-transitory computer-readable storage medium of claim 13, wherein thegenerated thumbnail is displayed proximate to the portion of the fulltextual transcript associated with the segment of the video program. 15.The non-transitory computer-readable storage medium of claim 11, whereinthe user interface data further comprises a histogram having a pluralityof features, each feature representing at least one of the segments ofthe video program and indicating how often the represented at least oneof the segments of the video program has been viewed by a second clientcomputer and a third client computer.
 16. The non-transitorycomputer-readable storage medium of claim 15, wherein the histogram isgenerated by means operable for: streaming at a first set of the videoprogram segments to a first client computer; streaming a second set ofthe video program segments to a second client computer; and generatingthe histogram from the combined first set of video program segments andthe second set of video program segments.
 17. The non-transitorycomputer-readable storage medium of claim 15, further operable for:accepting a first user input in the client computer to select one of thefeatures; and displaying the portion of the full textual transcriptassociated with the segment represented by the selected feature.
 18. Thenon-transitory computer-readable storage medium of claim 15, furtheroperable for: accepting a search query having a keyword; searching thefull textual transcript for a keyword; and annotating the histogram todelineate the video program segments having dialog including thekeyword.
 19. The non-transitory computer readable storage medium ofclaim 15, further operable for: displaying the histogram concurrentlywith the video player and the window including portions of the fulltextual transcript, wherein the plurality of features of the histogramare synchronized with portions of the full textual transcript; acceptinga selection of a feature of the plurality of features; determining acorresponding portion of the full textual transcript; and displaying thecorresponding portion of the full textual transcript in response toreceiving the selection of the feature.
 20. The non-transitorycomputer-readable storage medium of claim 11, wherein the portioncomprises a phrase.
 21. An apparatus for navigating a video programhaving a spoken dialog, comprising: one or more computer processors; anda computer-readable storage medium comprising instructions forcontrolling the one or more computer processors to be operable for:generating a full textual transcript of the spoken dialog, the fulltextual transcript comprising a plurality of portions wherein eachportion is associated with a segment of the video program; synchronizingeach portion of the full textual transcript with the associated segmentof the video program; accepting a command to select the video programfrom a client computer; and transmitting user interface data includingthe full textual transcript to the client computer for presentation in auser interface, the user interface comprising a concurrently presentedvideo program player playing the video program and a window configuredto include portions of the full textual transcript; accepting aselection for a portion of the full textual transcript included in thewindow of the user interface; determining the associated segment for theportion of the full textual transcript based on the synchronizing; anddynamically navigating to the segment of the video program associatedwith the selected portion of the full textual transcript in the videoprogram player in response to the accepted selection.
 22. The apparatusof claim 21, further comprising: a video player playing the videoprogram; and a decoder for decoding the closed captioning information ofthe played video program, wherein the video program is encoded withclosed captioning information and the generating a full textualtranscript of the dialog of the video program comprises: associating thedecoded closed captioning information with a temporal index of the videoprogram, and for generating the full textual transcript from the decodedclosed captioning information and the temporal index.
 23. The apparatusof claim 22, further operable for: selecting a frame from a segment ofthe video program, generating a thumbnail from the selected frame of thesegment, associating the generated thumbnail with the portion of thetranscript associated with the segment of the video program, and addingthe generated thumbnail to the full textual transcript.
 24. Theapparatus of claim 23, wherein the generated thumbnail is displayedproximate to the portion of the full textual transcript associated withthe segment of the video program.
 25. The apparatus of claim 21, whereinthe user interface data further comprises a histogram having a pluralityof features, each feature representing at least one of the segments ofthe video program and indicating how often the represented at least oneof the segments of the video program has been viewed by a second clientcomputer and a third client computer.
 26. The apparatus of claim 25,further operable for generating the histogram from a combined first setof video program segments streamed to a first client computer and asecond set of video program segments streamed to a second computer. 27.The apparatus of claim 25, wherein the user interface accepts a firstuser input in the client computer to select one of the features anddisplays the portion of the full textual transcript associated with thesegment represented by the selected feature.
 28. The apparatus of claim25, wherein the user interface further accepts a search query having akeyword and annotates the histogram to delineate the video programsegments having dialog including a keyword based upon a search of thefull textual transcript for the keyword.
 29. The apparatus of claim 25,further operable for: displaying the histogram concurrently with thevideo player and the window including portions of the full textualtranscript, wherein the plurality of features of the histogram aresynchronized with portions of the full textual transcript; accepting aselection of a feature of the plurality of features; determining acorresponding portion of the full textual transcript; and displaying thecorresponding portion of the full textual transcript in response toreceiving the selection of the feature.